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METHOD OF PURIFYING HETEROLOGOUS PROTEINS 

This application claims the benefit of a previously filed Provisional 
5 Application No. 60/129^99, filed April 14, 1999, the contents of which is 
incorporated in its entirety. 

Bac^round of ihe InvetUion 

10 Milk fcom domestic animals has been used as a source of proteins and 

other products for die food and pharmaceutical industries for many years. Among 
ruminants and laboratory animals, milk contains an average of 30 to 140 grams of 
protein per liter, or about 4-17% by weight, depending on the species. The bulk of 
fliese proteins are caseins, which are complexed with calcium and phosphate in 

15 supramolecular structures known as micelles. The other major class of milk 
proteins is whey proteins, predominantly comprised of beta-lactoglobulin and 
alpha-lactalbumin, but also including lactoferrin, immunoglobuluis, and serum 
albumiiL 

More recently, mediods have become known in tiie art for 
20 expressing exogenous protems at commercially feasible levels in the milk of 
transgenic animals. Exogenous protein expression levels generally range fiom 
less tfian 1 to 10 or more grains per liter, depending on the protein and the species. 
Commercial production of a wide range of proteins in the milk of transgenic 
livestock is now under development (A. J. Clark, et aL, Trends in Biotechnology^ 
25 5:20-24, 1987, A.J. C]BA:,Journalof Cellular Biochemistry, 49:121 1992; 
W. Bawden et al., Biotechnology and Genetic Engineering ReviewSy 12:89-137, 
1994; N. S. Rudolph, Genetic Engineering News, 15:8-9, 1995). Exogenous 
peptides, and in particular human peptides, may be produced in milk at relatively 
high concentrations and in large volumes, providing continuous high-level output 
30 of normally processed peptides. 
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Altiiiougb it is possible to produce high levels of exogenous pq>tides in tiie 
marnmary gland, flie milk also contains significant levels of Ihe animal's own 
seram proteins. It has been shown diatsenun proteins are present in milk at 2% 
of tfieir seium levels. Thus, mediods are needed which allow for separation of 
5 recombinant proteins secreted into Ihe milk of a transgenic aniinal fix>m an 
animal's own serum proteins. 

Summary of ihe Invention 

The present invention is based, in part, on ttie discovery of a separation 
1 0 method that can distinguish between two almost identical versions of a protein, 
e.g., a human protein and the protein of a host cell, e.g., of a transgenic host, e.g., 
a dairy animal host It was found that ttie host protein can be separated firom a 
heterologous protein produced in the milk of the animal by altering die host 
endogenous proteiiL 

15 It is often necessary to purify a heterologous protein fiee of the 

animal's version of tfie protein. Generally, it is estimated that ttie host serum 
proteins are found in milk at 2% oflheir serum level. Formany other proteins, the 
levels are in die microgram range. For exanq)le, goat antitfarombin m (Aim) 
levels in miDc are less than 3 (ig/ml. It is, however, necessary to remove the host 

20 animal's version oftheprotem to levels &r below that level. In addition, many 
proteins have a hig|h level of homology across species barriers which can impede 
methods of separating a host animal's version of a protein and a heterologous 
protein expressed by fliat animal. For example, human seram albumin A) is 
very similar to bovine serum albumin (BSA). It was found that by altering the 

25 host animal's version of the protein, a heterologous protein, e.g., a heterologous 
protein secreted into the milk of the animal, can be distinguished fiom and 
separated fixmi flie animal's endogenous protein. 
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According]^, in one aspect, die invention features a method of 
providing a heterologous polypeptide expressed by a transgenic mammal. Ihe 
me&od includes: providing a transgenic mammal which includes a transgenic 
sequence encoding die heterologous polypeptide and which expr^ses an altered 

5 form of die protein natursdly expressed die transgenic mammal; and, obtaining 
a sample fiom die transgenic mammal, wherein die sample comprises fhe 
heterologous polypeptide and the altered protein, to diereby provide die 
heterologous polypeptide. 

In a preferred embodiment, the mediod can further include 

10 separating die altered protein fiom die sample, e.g., a tissue or a fluid (e.g., milk, 
blood, urine). 

In a preferred embodiment, the transgenic mammal is homozygous 
for the gene which encodes the altered protem. In another preferred embodiment, 
die transgenic animal is hetero^gous for the geae which encodes the altered 

15 protein, e.g., the altered protein is a dominantly e?qnessed protein. 

In a preferred embodiment, the protein naturally expressed in the 
transgenic mammal is altered by replacing fhe gene encoding the protein widi a 
DNA sequence encoding die heterologous polypeptide, e.g., a cDNA sequence or 
a genomic sequence encoding Ihe heterologous pofypq)tide. In a preferred 

20 embodiment, a cDN A encoding die heterologous polypeptide is uiserted prior to 
the initiation codon of die gene encoding die naturally e^qnessed protein with or 
without removing die sequence coding for the naturally expressed protein fiom 
the genome. If the sequence coding for die naturally expressed protein is not 
removed, die signal sequence of die protein can be removed and/or a termination 

25 sequence canbe inserted at the 3* end ofthecDNA sequence encoding die 

heterologous polypeptide. Thus, replacement means "^replacement of expression** 
and is not Imked to an ''exchange*' of one gene for anodier in the genome. 

In anodier preferred embodiment, the gene encoding the naturally 
expressed protein is replaced widi a DNA sequence, e.g., die genomic sequence, 
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encoding the heterologous polypeptide^ e.g., die sequence encoding the naturally 
expressed protein is removed and iq)laced with, e.g.. die genomic sequence, 
which encodes die heterologous polypq>tide. In a prefened embodiment, die 
sequence encoding die hetmlogous protein is inserted: after die 5' untranslated 

5 region (UTR) of die naturally expressed protein; prior to die 3* UTR of die 
natural^ expressed protein; m between die S ' UTR and die 3 * UTR of die 
naturally expressed protem. 

In apreferred embodiment, the protein naturally expressed in die 
transgenic mammal is altered by changing the sequence of die gene, e.g., by 

10 adding, deleting, or substituting a nucleotide, such that the protein is changed. 
The protein can be changed by, for example, deleting at least one amino acid, 
adding at least one amino acid, or die substitution of at least one amino acid. A 
nucleotide can be added at die C-terminus, die N-terminus, or at internal points 
widiin die gene, e.g., to achieve die change m the protein. In a preferred 

15 embodiment, the protein naturally expressed in die transgenic mamfnai is changed 
by adding at least one amino acid to die protein, e.g., an added amino acid or 
amino acids that can bind to a preselected ligand, e.g., a 6X HIS ligand, a 
cellulose binding domain (CBD) ligand, a maltose binding protein (MBP) ligand. 
In a preferred embodiment, the protein naturally expressed in die transgenic 

20 mammal is changed by addition of an afBnity tag to the gene encoding the 

naturally expressed protein. In a preferred embodiment, more dian 1-100, S-SO, 
10-30 amino acid residues can be added to the protein. The residues allow for a 
physical difference between die altered protein and die transgenic product and are 
useful, e.g., in purification. 

25 In a preferred embodiment, die protein naturally expressed in the 

transgenic mammal is altered by changing die gene encoding die protem such that 
die stop codon of die protem is altered, e.g., die stop codon is altered by 
substitution of at least one nucleotide in the stop codon. In another preferred 
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embodiment, fhe stop codon is altered by removal of at least one nucleotide or 
addition of at least one nucleotide in &e sequence encoding die stop codon. 

In a preferred embodiment, the protein naturally expressed in the 
transgenic mammal is altered by changiiig an amino acid sequence of die protein 

5 such diat the stop codon is altered, e.g., die stop codon is altered by addition of at 
least one amino acid. In a preferred embodiment, the amino acid is added using a 
suppressor tRNA and a transferase enzyme. In a preferred embodiment, the 
transgenic animal includes one or more of: a transgenic sequence encoding a 
suppressor tRNA; a transgenic sequence encoding a transferase en^me; a 

10 transgenic sequence encoding a suppressor fBNA and a transgenic sequence 

encoding a transferase enzyme; a transgenic sequence encoding a siqipresor fRNA 
under die control of a tissue-specific promoter, a transgenic sequence encoding a 
transferase enzyme under the control of a tissue-specific promoter. 

In a preferred embodiment, the transgenic mammal is selected from 

15 the group consisting of a goat, a bovine, a sheep, a pig, a horse, a mouse, and a rat 
Preferably, the transgenic mammal is a bovine or a goat 

In a preferred embodiment, the heterologous polypeptide is a 
human polypeptide. In a preferred embodiment, flie heterol(>gous polypeptide is: 
a serum protein, a milk proteni, a glycosylated or a nonglycosylated protein. A 

20 heterologous polypeptide includes, but is not limited to, any of: alpha proteinase 
inhibitor, alkaline phosphotase, angiogenin, extracellular superoxide dismutase, 
fibrogen, glucocerebrosidase, glutamate decarboxylase, human serum albumin, 
myelm basic protein, preinsulin, soluble CD4, lactoferrin, lactoglobulin, 
lyso^^e, lactoalbumin, erydupoietin, tissue plasminogen activator, human 

25 growth fector, antittrombin m, insulin, prolactin, and alpha-antitrypsin. 

In a preferred embodiment, the heterologous sequence is under the 
control of a promoter, e.g., a tissue-specific promoter (e.g., a miDc-specific 
promoter, a blood-specific promoter, or a urine-specific promoter). A 
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mOk-specific promoter can be any of: a casein promoter, a beta lactoglobulin 
promoter, a whey acid protein promoter, and a lactalbumin promoter. 

In another aspect, tfie invention features a me&od of providing a 

5 heterologous polypeptide expressed by a transgenic mammal. Themefliod 
includes: providing a transgenic mammal which includes a transgenic sequence 
encoding flie heterologous polypeptide and which expresses an altered form of the 
protein naturally expressed by tiie transgenic mammal, wherein gene encoding the 
naturally expressed protein is replaced with tiie sequence encoding the 

10 heterologous polypeptide; and obtaining a sample fiom the transgenic mammal, 
wherein the sample includes the heterologous polypeptide and tiie altered protein, 
to tiiereby provide a heterologous polypeptide. 

In a preferred embodiment, the method furtiier includes separating 
tiie altered protein fiom tiie sanq>le, e.g., a tissue or fluid (e.g., milk, blood, urine). 

15 In a preferred embodiment; tiie transgeiuc mammal is homozygous 

for the gene which encodes the altered protein. 

In a preferred embodiment, the protein naturally expressed in the 
transgenic manmial is altered by replacing tiie gene encoding tiie protein witti a 
DNA sequence encoding the heterologous polypeptide, e.g., a cDNA sequence or 

20 a genomic sequence encoding tiie heterologous polypeptide. In a preferred 

embodiment, a cDNA encoding tiie heterologous pofypeptide is inserted prior to 
tiie initiation codon of tiie gene encoding die naturally expressed protein witii or 
without removing the sequence coding for tiie natuialfy expressed protein fiom 
tiie genome. If tiie sequence encoding the naturally expressed protein is not 

25 removed fiom tiie genome, the signal sequence of tiie protein can be removed 
and/or a termination sequence can be inserted at tiie 3 * end of the cDNA sequence 
encoding the heterologous polypeptide. Thus, replacement means ^^replacement of 
expression" and is not linked to an "exchange" of one gene for another in the 
genome. 
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In another preferred embodiment, the gene encoding the naturally 
expressed protein is replaced wifli a DNA sequence, e.g., the genomic sequence, 
encoding the heterologous polypeptide, e.g., the sequence encoding die naturally 
expressed protein is removed and replaced widi, e.g., the genomic sequence of the 
heterologous polypeptide. In a preferred embodiment, the sequence encoding the 
heterologous protein is inserted: after the 5' untranslated region (UTR) of die 
naturally expressed protein; prior to tiie 3' UTR of the naturally expressed protein; 
in between the 5' UTR and the 3* UTR of the naturally expressed protem. 

In a preferred embodiment, the transgenic mammal is selected from 
tibe group consisting of a goat, a bovme, a sheep, a pig, a horse, a mouse, and a rat 
Preferably, die transgenic mammal is a bovine or a goat 

In a preferred embodiment; the heterologous polypeptide is a 
human polypeptide. In a preferred embodiment, flie heterologous polypq)tide is: 
a senmi protein, a rnilkpn)tein, a glycosylated or a nonglycosyhited protein. A 
heterologous polypeptide inchides, but is not limited to, any of: alpha proteinase 
inhibitor, alkaline phosphotase, angiogenin, extracellular superoxide dismutase, 
fibrogen, glucocerebrosidase, glutamate decarboT^lase, faunum serum albumin, 
myelin basic protein, proinsulin, soluble CD4, lactoferrin, lactoglobuhn, 
Ijrsozyme, lactoalbumin, erythrpoietin, tissue plasminogen activator, human 
growdi factor, antidirombin m, insulin, prolactin, and aipha^-antitrypsin. 

In a preferred embodiment, the heterologous sequence is under the 
control of a promoter, e.g., a tissue-specific promoter (e.g., a milk-specific 
promoter, a blood-specific promoter, or a urine-specific promoter). A 
milk-specific promoter can be any of: a casern promoter, a beta lactoglobulin 
promoter, a whey acid protein promoter, and a lactalbmnin promoter. 

In another aspect, the invention features a method of providing a 
heterologous polypeptide expressed by a transgenic mammal. Themediod 
mcludes: providing a transgenic mammal which inchides a transgenic sequence 
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encoding die heterologous polypeptide and which expresses an altered fonn of die 
protein naturally expressed by the transgenic twammul, wherein gene encoding tte 
naturally expressed protein is modified, e.g., by addition, deletion, or substitution 
of at least one nucleotide, such fhat Ae amino acid sequence is ahered, e.g., by the 

5 addition, deletion, or substitution of at least one amino acid; and obtaining a 
sample fiom the transgenic mammal, wherein lbs sample includes the 
heterologous polypeptide and the altered proteii^ to fliereby provide the 
heterologous polypeptide. 

In a preferred embodiment, Hie method further includes separating 

10 flie altered protein from the sample, e.g., a tissue or a fluid (e.g., mOk, blood, 
urine). 

In a preferred embodiment, the transgenic mammal is homozygous 
for the gene which encodes the altered protein. 

In a preferred embodiment, tiie protein naturally expressed in the 

15 transgenic mammal is altered by changing the sequence of the ^ne, e.g., by 
adding, deleting, or substituting a nucleotide, such that ttie protein is changed. 
The protein can be changed by, for example, deleting at least one amino acid, 
adding at least one amino acid, substitution of at least one amino acid. A 
nucleotide can be added at the C-terminus, the N-terminus, or at internal points 

20 within the gene, e.g., to achieve flie change in die protein. Inapreferred 

embodiment, Ae protein naturally expressed in die transgenic mammal is changed 
by adding at least one amino acid to die protein, e.g., an added amino acid or 
amino acids that can bind to a preselected ligand, e.g., a 6X HIS ligand, a CBD 
ligand, a MBP ligand. In apreferred embodiment, the altered protein is separated 

25 fiom die sample by binding the altered protein to a preselected Ugand, e.g., a 6X 
HIS ligand (e.g., a metal chelating column), CBD ligand (e.g., cellulose) or a 
MBP ligand. 
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In a preferred embodiment, more flian 1-100, S-SO, 10-30 amino acid 
residues can be added. The residues allow for a physical difference between flie 
altered protein and the transgenic product and are useful, e.g., in purification. 
In a preferred embodiment, the protein naturally e^qiressed in flie 

5 transgenic nuunmal is changed by addition of an affinity tag to the gene encoding 
the naturally expressed protein. 

In a preferred embodiment, tiie protein naturally expressed in the 
transgenic mammal is altered by changing die gene encoding the protein such Oat 
die stop codon of die protein is altered, e.g., the stop codon is altered by 

10 substitution of at least one nucleotide in the stop codon. In another preferred 
embodiment, the stop codon is altered by removal of at least one nucleotide or 
addition of at least one nucleotide in the sequence encoding the stop codon. 

In a preferred embodiment the transgenic mammal is selected fix>m die 
group consisting of a goat, a bovine, a sheep, apig, a horse, a mouse, and a rat 

1 5 Preferably, flie transgenic mammal is a bovine or a goat 

In a preferred embodiment, d^ heterologous pofypeptide is a 
human polypeptide. In a preferred embodiment the heterologous polypeptide is: 
a serum protein, a milk protein, a glycosylated or a nongfycosylated protein. A 
heterologous polypeptide includes, but is not limited to, any of: alpha proteinase 

20 inhibitor, alkaline phosphotase, angiogenin, extracellular superoxide dismutase, 
fibrogen, glucocerebrosidase, glutamate decarboxylase, human serum albumin, 
myelin basic protem, proinsulin, soluble CD4, lactoferrin, lactoglobulin, 
lysozyme, lactoalbumin, erydiropoietin, tissue plasminogen activator, human 
growdi factor, antithrombm m, insulin, prolactin, and alpha-antitrypsin. 

25 In a prefened embodunent, the heterologous sequence is under the 

control of a promoter, e.g., a tissue-specific promoter (6.g., a milk-specific 
promoter, a blood-specific promoter, or a urine-specific promoter). A 
milk-specific promoter can be any of: a casein promoter, a beta lactoglobulin 
promoter, a whey acid protein promoter, and a lactalbumin promoter. 
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In another aspect* die invention features a mediod of providing a 
heterologous polypeptide expressed by a transgenic manunal. The method 
includes: providing a transgenic mammal which mchides a transgenic sequence 

5 encoding ttie heterologous polypeptide and which expresses an altered farm of the 
protem naturally expressed by die transgenic mammal, wherein the naturally 
expressed protein is altered such that the stop codon in the protein is altered, e.g., 
by addition, deletion or substitution of an amino acid to die stop codon; and 
obtaining a sample from die transgenic mammal, wherein die sample includes the 

10 heterologous polypeptide and die altered protein, to thereby provide die 
heterologous polypeptide. 

In a prefened embodiment, the transgenic mammal is homozygous 
far die gene which encodes the altered protein. In anoflier preferred embodiment, 
die transgenic ammal is heteroaygous for die gene which encodes die altered 

15 protein, e.g., die altered protein is a dominantly expressed protein. 

In a preferred embodunent, die mediod further includes separating 
die altered protein from the sample, e.g., a tissue or a fluid (e.g., milk, blood or 
urine). 

In a preferred embodiment, die stop codon is altered by addition of 
20 at least one anuno acid to die stop codon. In a prefened embodiment, die amino 
acid is added using a suppressor ANA and a transferase en^me. Inaproferred 
embodiment, the transgenic animal includes one or more of: a transgenic sequence 
encoding a suppressor tRN^ a transgenic sequence encoding a transferase 
enzyme; a transgenic sequence encoding a suppressor tRNA and a transgenic 
25 sequence encoding a transferase eozym^; a transgenic sequence encoding a 
suppresor fBNA under the control of a tissue-specific promoter; a transgenic 
sequence encoding a transferase enqmie under the control of a tissue-specific 
promoter. 
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In a prefeired embodiment, transgenic mammal is selected from 
fee group consisting of a goat, a bovine, a sheep, a pig, a horse, a mouse, and a rat 
Pieferably, the transgenic mammal is a bovine or a goat 

In a prefeired embodiment, die heterologous polypeptide is a 
5 human polypeptide. In a preferred embodiment, the hetmlogous polypeptide is: 
a serum protein, a milk protein, a glycosylated or a nonglycosylatedprotem. A 
heterologous polypeptide inchides, but is not lunited to, any of: alpha proteinase 
inhibitor, alkaline phosphotase, angiogenin, eTCtracellular superoxide dismutase, 
fibrogen, glucocerebrosidase, glutamate decarboxylase, human serum albumin, 
10 myelin basic protein, proinsulin, soluble CD4, lactoferrin, lactoglobulin, 
lysozyme, lactoalbumin, eiythrpoietin, tissue plasminogen activator, human 
growdi &ctor, antittirombin m, insulin, prolactin, and a^ha-antltrypsia 

In a preferred embodiment, the heterologous sequence is under the 
control of a promoter, e.g., a tissue-specific promoter (e.g., a milk-specific 
15 promoter, a blood-specific promoter, or a urine-specific promoter). A 

milk-specific promoter can be any of: a casein promoter, a beta lactoglobulin 
promoter, a whey acid protein promoter, and a lactalbumin promoter. 



In another aspect, die invention features a metfiod of providing 
20 human serum albuinin expressed by a transgenic inammal. Hie mediod includes: 
providing a transgenic mammal which includes a transgene encoding human 
serum albumin and which expresses an altered form of the serum albumin 
naturally expressed by die transgenic mammal; and, obtaining a sample fix>m the 
transgenic mammal, wherem die sample includes human serum albumin and the 
25 dtered serum albumin, to thereby provide human serum albumin. 

In a preferred embodiment, the mediod fiirdier includes sq)arating 
Ifae altered serum albumin from the sample, e.g., miDc, blood, urine. 

In a preferred embodiment, the transgenic mammal is homo^gous 
for gene which encodes die altered serum albumin. In anodier preferred 
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embodiinent, the transgenic animal is heterozygous for flie gene which encodes 
the altered senun albumin, e.g., the altered serum alhimiin is a dominant 
expressed protein* 

In a preferred embodiment, die serum flihun^in naturally expressed 

5 in die transgenic mammal is altered by replacing tte gene encoding die serum 
albumin with a DNA sequence encoding human serum albumin, e.g., a cDNA 
sequence or a genoxnic sequence encoding human serum albumin. Inaprefened 
embodiment, a cDNA encoding human serum albumin is inserted prior to die 
initiation codon of the gene encoding the naturally expressed serum albumin with 

10 or without removing the sequence coding for die natumlly expressed serum 
albumin fiom die genome. If the sequence encoding the naturally esqnessed 
serum albumin is not removed, the signal sequence of die serum albumin can be 
removed and/or a termination sequence can be added to the 3' end of the cDNA 
encoding human serum albumm. Thus, replacement means ^'replacement of 

15 expression" and is not linked to an "exchange** of one gene for anodier in die 
genome. 

In another preferred embodiment, die gei^ encoding die naturally 
expressed serum albumin is replaced widi a DNA sequence, e.g., die genomic 
sequence, encoding human serum albumin, e.g., the sequence encoding the 

20 luiturally expressed serum albumin is removed and replaced widi, e.g., the 
genomic sequence of human serum albumin. In a preferred embodiment, the 
sequence encoding the human serum albumin is inserted: after die 5' untranslated 
region (UTR) of die naturally expressed serum albunun; prior to die 3 * UTR of the 
naturally expressed serum albumiin in between the S * UTR and the 3 * UTR of the 

25 naturally expressed serum albumin. 

In a prefeired embodiment, the serum albumin naturally expressed in fhe 
transgraic mammal is altered by changing die sequence of the gene, e.g., by 
adding, deleting, or substituting a nucleotide, such duit the serum albumin is 
changed. The serum albumin can be changed by, for example, deleting at least 
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one amino acid, adding at least one amino acid, or flie substitution of at least one 
aminoacid. A nucleotide can be added at the C-tenninus, the N-tenninus, or at 
internal points wifliin flie gene, e.g., to achieve fte change in flie protein. In a 
preferred embodiment, ihe senmi albumin naturally esqnessed in the transgenic 

5 mammal is changed by adding at least one amino acid to tfie protein, e.g., an 
added amino acid or amino acids that can bind to a preselected ligand, e.g., a 6X 
HIS ligand, a cellulose binding domain (CBD) ligand, a maltose binding protein 
(MBP) ligand. In a prefened embodnnent, the altered sennn albumin is separated 
fiom the sample by binding the altered senmi albumin to a preselected ligand, 

10 e.g., a 6X HIS ligand (e.g., a metal chelating column), CBD ligand (e.g., 
cellulose) or a MBP ligand. 

In a prefened embodiment, more tfian 1-100, 5-50, 10-30 amino acid 
residues can be added. The residues allow for a physical difference between the 
altered serum albumin and die transgenic product and are useful, e.g., in 

15 purification. 

In a preferred embodiment, the serum albumin naturally expressed in the 
transgenic mammal is changed by addition of an afGnity tag to flie gene encoding 
tiie naturally expressed protein. 

In aprefened embodiment, the serum albumin naturally expressed 

20 in the transgenic mammal is altered by changing the gene encoding the protein 
such tiiat the stop codon of the serum albumin is altered, e.g., the stop codon is 
altered by substitution of at least one nucleotide in the stop codon. In anotfier 
prefened embodiment, flie stop codon is altered by removal of at least one 
nucleotide or addition of at least one nucleotide in the sequence encoding die stop 

25 codon. 

In a preferred embodiment, the senmn albumin naturally expressed 
in die transgenic mammal is altered by changing an amino acid sequence of die 
protein such that the stop codon is altered, e.g., die stop codon is altered by 
addition of at least one amino acid. In a preferred embodiment, the amino acid is 
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added using a suppressor tRNA and a transferase enzyme. Inaprefened 
embodiment, the transgenic animal includes one or more of: a transgenic sequence 
encoding a suppressor tRNA; a transgenic sequence encoding a transferase 
enzyme; a transgenic sequence encoding a suppressor tRNA and a transgenic 

5 sequence encoding a transferase enzyme; a transgenic sequence encoding a 
suppresor tRNA under the control of a tissue-specific promoter, e.g., a liver- 
specific promoter, a transgenic sequence encoding a transferase en^me under die 
control of a tissue-specific promoter, e.g., a liver-specific promoter. 

In a preferred embodiment, die transgenic mammal is selected fiom 

10 die group consisting of a goat, a bovine, a ^eep, a pig, a horse, a mouse, and a rat 
Preferably, the transgenic mammal is a bovine or a goat 

In a preferred embodiment, the human serum albumin sequence is 
under die control of a promoter, e.g., a tissue-specific promoter, e.g., a 
milk-specific promoter, a blood-specific promoter, or a urine-specific promoter. 

15 A milk-specific promoter can be any of: a casein promoter, a beta lactoglobulin 
promoter, a whey acid protein promoter, and a lactalbumin promoter. 



In another aspect, the invention features a mediod of providing 
human serun albumin expressed by a transgenic bovine. The method includes: 
20 providing a transgenic bovine which includes a transgene encoding human serum 
albumin and which expresses an altered form of bovine serum albumin; obtaining 
a sample from the transgenic bovine, wherein the sanq)le includes hiunan serum 
albumin and die altered bovine serum albumin, to diereby provide human serum 
albumin. 

25 In a preferred embodiment, die mediod fiirdier includes separatmg 

the altered bovine serum albumin fiom die sample, e.g., milk, blood, urine. 

In a preferred embodiment, the transgenic bovine is homozy^us 
for the gene which encodes die altered bovine serum albumin. In another 
preferred embodiment, the transgenic bovine is heterozygous for die gene which 
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encodes Ae altered bovine senun albumin, e.g., flie altered bovine serum albumin 
is a dominantly expressed protein. 

In a preferred embodiment, Ae bovine serum albumin is altered by 
replacing the gene encoding bovine senun albumin witii a DNA sequence 

5 encoding human serum albumin, e.g., a cDNA sequence or a genomic sequence 
encoding the human serum albumin. In a prefened embodiment, a cDNA 
encoding human serum albumin is inserted prior to die initiation codon of the 
gene encoding bovine serum albumin wifli or wifliout removing die sequence 
coding for the naturally expressed serum albumin fiom die genome. In a preferred 

10 embodiment, die cDNA encoding human serum albumin is inserted into exon 2 of 
die bovine serum albumin gene. In anodier preferred embodiment, the signal 
sequence can be removed fiom die bovine serum albumin and/or a termination 
sequence, e.g., a bovine polyA termination sequence, can be inserted onto the ^3 
end of die cDNA encoding human serum albumin. Thus, replacement means 

15 *^lacement of expression*' and is not linked to an **exchange** of one gene for 
anodier in the genome. 

In another preferred embodiment, die gene encoding bovine serum 
albumin is replaced with a DNA sequence, e.g., die genomic sequence, encoding 
human serum albumin, e.g., the sequence encoding die bovine serum albumin is 

20 removed and replaced with, e.g., the genomic sequence, of human serum albumin. 
In a preferred embodin[ient, the genomic sequence encoding human serum 
albumin is inserted: after die S' untranslated region (UTR) of bovme serum 
albumin; prior to the 3 ' UTR of bovine serum albumii^ in between the 5 ' UTR 
and the 3* UTR of bovine serum albumin. 

25 In a preferred embodiment, the bovine serum albumin is altered by 

changing the sequence of the gene, e.g., by adding, deleting, or substituting a 
nucleotide, such that the bovine serum albumin is changed. The bovine serum 
albumin can be changed by, for example, deleting at least one amino acid, adding 
at least one amino acid, or die substitution of at least one amino acid. A 
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nucleotide can be added at tte C-tenninus, the N-tenniniis, or at internal points 
within the gene, e.g. to achieve the change in tbt bovine serum albumin. In a 
preferred embodiment, &e bovine serum albumin is changed by adding at least 
one amino acid to the protein, e.g., an added anoino acid or amino acid that can 

5 bind to a preselected ligand, e.g., a 6X HIS ligand, a celhilose binding domain 
(CBD) ligand, a maltose binding protein (MBP) ligand. In a preferred 
embodiment, &e altered bovine serum albumin is separated fiom Ae saiq)le by 
binding the altered bovine smm albumin to a preselected ligand, e.g., a 6X HIS 
ligand (e.g., a metal chelating column), CBD ligand (e.g., cellulose) or a MBP 

10 ligand (e.g., maltose). 

In a preferred embodiment, more than 1-100, S-SO, 10-30 amino acid 
residues can be added. Ihe residues allow for a physical difference between die 
altered bovine serum albumin and the human serum albumin and are useful, e.g., 
in purification. 

15 In a preferred embodiment, the bovine serum albumin is changed by 

addition of an afiSnity tag to die gene encoding the naturally expressed protein. 

In a preferred embodiment, the bovine serum albumin is altered by 
changing the gene encoding the protein such that flie stop codon of the bovine 
serum albumin is altered, e.g., the stop codon is altered by substitution of at least 

20 one nucleotide in die stop codon. In anodier preferred embodiment, die stop 
codon is altered by removal of at least one nucleotide or addition of at least one 
nucleotide in the sequence encoding the stop codon, e.g., removal of at least one 
nucleotide of the TAA stop codon of bovine serum albumin. 

In a preferred embodiment, the bovine serum albumin is altered by 

25 changing an amino acid sequence of the protein such that die stop codon is 

altered, e.g., the stop codon is altered by addition of at least one amino acid. In a 
preferred embodiment, the amino acid is added using a suppressor tRN A and a 
transferase enzyme. In a preferred embodiment, die transgenic animal includes 
one or more of: a transgenic sequence encodmg a suppressor tRNA; a transgenic 
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sequence encoding a transferase enzyme; a transgenic sequence encoding a 
suppressor tRN A and a transgenic sequence encoding a transferase enzyme; a 
transgenic sequence encoding a suppresor tRNA under die control of a 
tissue-specific promoter, e.g., a liver-specific promoter, a transgenic sequence 

5 encoding a transfierase enzyme under die control of a tissue-specific promoter, 
e.g., a liver-specific promoter. In a preferred embodiment, the suppressor tRNA 
recognizes at least 2, 3, 4, S, 6 base codons, e.g., the suppressor tRNA recognizes 
the TAAAC sequence at the end of the bovine serum albumin coding region. 

In a prefened embodiment, fte human serum albumin sequence is 

10 under tfie control of a promoter, e.g., a tissue-specific promoter, e.g., a 

milk-specific promoter, a blood-specific promoter, or a urine-specific promoter. 
A milk-specific promoter can be any of: a casein promoter, a beta lactoglobulin 
promoter, a whey acid protein promoter, and a lactalbumin promoter. 

15 In another aspect, the invention features a method of providing a 

heterologous polypeptide expressed by a host cell. Ihe method mchides: 
providing a cell which mcludes a nucleotide sequence encoding the heterologous 
polypeptide and which expresses an altered form of the protein natural^ 
expressed by the cell; and, obtaining a sample fiom the cell, wherein the sample 

20 comprises the heterologous polypeptide and the altered protein, to diereby provide 
the heterologous polypeptide. 

In a prefened embodiment, the method can finther include 
separating the altered protein fitmi die sample, e.g., separating ttie altered protein 
fiom the culture media. 

25 In a preferred embodiment, the cell is homozygous for the gene 

which encodes the altered protem. In anodier preferred embodiment, the cell is 
heterozygous for the gene which encodes the altered protein, e.g., the altered 
protein is a dominantiy expressed protein. 
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In a preferred embodiment, die protein naturally expressed in die 
ceU is altered by repladng die gene encoding die protein widi a DNA 
encoding the heterologous polypeptide, e.g., a cDNA sequence or a genomic 
sequence encoding die heterologous polypeptide. In a preferred embodiment, a 

5 cDNA encoding die heterologous polypeptide is inserted prior to die irutiation 
codon of the gene encoding the naturally expressed protein widi or widiout 
removing the sequence coding for die naturally expressed protein fiom die 
genome. If die sequence coding for die naturally expressed protein is not 
removed, the signal sequence of the protein can be removed and/or a termination 

10 sequence can be inserted at the 3' end of die cDNA sequence encoding die 

heterologous polypeptide. Thus, replacement means "rephcement of expression" 
and is not linked to an ^'exchange*' of one gene for another in the genome. 

In anodier preferred embodhnent, die gene encoding the naturally 
expressed protein is replaced widi a DNA sequence, e.g., the genonuc sequence, 

15 encoding the heterologous polypeptide, e.g., the sequence encoding die naturally 
expressed protein is removed and replaced with, e.g., die genomic sequence, 
which encodes the heterologous polypeptide. In a preferred embodiment, die 
sequence encoding die heterologous protein is inserted: after die 5* untranslated 
region (UTR) of die naturally ^pressed protein; prior to die 3' UTR of the 

20 naturally expressed protein; in between die 5* UTR and die 3* UTR of die 
naturally expressed protein. 

In a preferred embodiment, the protem luiturally expressed in the 
cell is altered by changing the sequence of the gene, e.g., by adding, deleting, or 
25 substituting a nucleotide, such diat the protein is changed. The protein can be 
changed by, for example, deleting at least one amino acid, adding at least one 
amino acid, or the substitution of at least one amino acid. A nucleotide can be 
added at the C-terminus., die N-tenninus, or at internal points widiin die gene, e.g., 
to achieve die change in the protein. In a preferred embodiment, die protein 
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naturally expressed in the cell is changed by adding at least one amino acid to the 
protein, e.g., an added amino acid or amino acids ttiat can bind to a preselected 
ligand, e.g., a 6X HIS ligand, a cellulose binding domain (CBD) ligand, a maltose 
bindmg protein (MBP) ligand. In a prefened embodiment, ttie protein naturally 

5 expressed in the cell is changed by addition of an afiSnity tag to the gene encoding 
the naturally exptessei protein. In a prefened embodunent, more tfian 1-100, S- 
50, 1 0-30 amino acid residues can be added to ihe protein. The residues allow for 
a physical difference between the altered protein and the heterologous product and 
are useful, e.g., in purification. 

10 In a prefeired embodiment, the protein naturally expressed in the 

cell is altered by changing die gene encoding fte protein such diat the stop codon 
of tfie protein is altered, e.g., flie stop codon is altered by substitution of at least 
one nucleotide in the stop codon. In anodier preferred embodiment, flie stop 
codon is altered by removal of at least one nucleotide or addition of at least one 

15 nucleotide in the sequence encoding the stop codon. 

In a prefeired embodiment, the protem naturally expressed in the 
cell is altered by changing an amino acid sequence of die protein such diat the 
stop codon is altered, e.g., die stop codon is altered by addition of at least one 
amino acid. In a prefeired embodiment, the amino acid is added using a 

20 suppressor tRNA and a transferase enzyme. In a preferred embodiment, the cell 
mcludes one or more of: a sequence encoding a suppressor tRNA; a sequence 
encoding a transferase enzyme; a sequence encoding a supjiressor tRNA and a 
sequence encoding a transferase enzyme; a sequence encoding a suppresor tRNA 
under the control of a promoter, a transgenic sequence encoding a transferase 

25 ens^e under the conlroi of a promoter. 

In a preferred embodiment, the cell is: a eukaiyotic cell. In a prefeired 
embodiment, the cell is of fiingal, plant or animal origin, e.g., vertebrate origin. In 
a preferred embodiment, the cell is: a mammalian cell, e.g., a primary or 
secondaiy mammalian cell, e.g., a fibroblast, a hematopoietic st^ cell, a 
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myoblast, a keratinocyte, an epi&elial cell, an endoflielial cell, a glial cell, a neuxal 
cell, a cell comprising a fomied element of the blood, a muscle cell and piecuisors 
of tiiese somatic cells; a transfonned or immortalized cell line. Preferably, ttie cell 
is a human cell. In anofter embodiment, die cell line can be cell line other flian a 

5 human cell line, e.g., a CHO cell line, a COS cell line. 

In a preferred embodiment, the heterologous polypeptide is a 
human polypeptide. In a preferred embodiment, the heterologous polypeptide is: 
a serum protein, a milk protein, a glycosylated or a nonglycosylated protein. A 
heterologous polypeptide includes, but is not limited to, any o£ alpha proteinase 

10 mhibitor, alkaline phosphotase, angiogenin, extracellular superoxide dismutase, 
fibrogen, glucocerebrosidase, glutamate decaiboxylase, human sexum albumin, 
myelin basic piotein, proinsulin, soluble CD4, lactoferrin, lactoglobulin, 
lysozyme, lactoalbumin, eiyOupoietin, tissue plasminogen activator, human 
growth factor, antithrombin HI, insulin, prolactin, and alpha-antitcypsin. 

15 

Also included in the invention are animals, sperm, oocytes, embiyos, or 
fetuses which include an altered version of a naturally expressed protein as 
described herein. The invention fur&er includes cell Un^ and DNA constructs as 
described herein. 

20 

AnoAer aspect of Hie invention features a me&od of evaluating an 
affinity tag for its utility in separating a heterologous polypqrtide and tiie protein 
naturally expressed in die transgenic animal. The evaluation can be performed iit 
VIVO or in vitro. In vitro methods can include engineering a gene comprising an 
25 affinity tag by standard recombinant methods, obtaining a recombinantly tagged 
protein and adding botii tiie tagged protein and a heterologous polypeptide into a 
sample, e.g., a bodily fluid (e.g., milk, blood, urine), and evaluating tiie level of 
tagged protein that can be separated from &e sample, to tiiereby evaluate tiie 
utility of tiie tag. 
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The terms ''protein", '^polypeptide" and '^peptide" are used interchangeably 

herein. 

5 

As used herein, the term "transgenic sequence" refers to a nucleic acid 
sequence (e.g., encoding one or more human proteins), which is inserted by 
artifice into a cell The transgenic sequence, also referred to herein as a transgene, 
becomes part of the genome of an animal which develops in whole or in part from 

10 that cell. In embodiments of the invention, the transgenic sequence is integrated 
into the chromosomal genome. If the transgenic sequence is integrated into the 
genome it results, merely by virtue of its insertion, in a change in the nucleic acid 
sequence oftfae genome into which it is inserted. A transgenic sequence can be 
partly or entirely species-heterologous, i.e., the transgenic sequence, or a portion 

15 thereof, can be from a species which is different from the cell into which it is 

introduced. A transgenic sequence can be partly or entirely species-homologous, 
i.e., the transgenic sequence, or a portion thereof, can be from the same species as 
is the cell into which it is introduced. If a transgenic sequence is homologous (in 
the sequence sense or in the species-homologous sense) to an endogenous gene of 

20 fhe cell into which it is introduced, then flie transgenic sequence, preferably, has 
one or more of the following characteristics: it is designed for insertion, or is 
inserted, into the cell's genome in such a way as to alter &e sequence of the 
genome of the cell into which it is inserted (e.g., it is inserted at a location which 
differs from that of the endogenous gene or its insertion results in a change in the 

25 sequence of the endogenous gene); it includes a mutation, e.g., a mutation which 
results in misexpression of die transgenic sequence; by virtue of its insertion, it 
can result in misexpression of Hie gene into which it is inserted, e.g., the msertion 
can result in a knockout of the gene into which it is inserted. A transgenic 
sequence can include one or more transcriptional regulatory sequences and any 

30 other nucleic acid sequences, such as introns, that may be necessary for a desired 
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level or pattern of expression of a selected nucleic acid, all operably linked to fhe 
selected nucleic acid. The transgenic sequence can include an enhancer sequence 
and or sequences which allow for secretion. 

5 As used herein, a "transgenic animal** is an animal, e.g., a non-human 

mammal, e.g., a bovine, a goat, a horse, a sheep, a camel, a llama, a mouse, or a 
rat, in which one or more, and preferably essentially all, of the cells of &e animal 
include a tiansgene. The tiansgene can be introduced into the cell, diiecdy or 
indirectly by introduction into a preciusor of the cell, e.g., by microinjection, 

10 transfection or infection, e.g., by infection with a recombinant vnus, or by nuclear 
transfer. Hie term genetic manipulation is directed to the introduction of a 
recombinant DNA molecule. This molecule may be integrated within a 
chromosome, or it may be extrachromosomally replicating DNA. Transgenic 
animals can be, e.g., hetero^gous or homozygous for a transgene. 

15 

Other features and advantages of the invention will be apparent fiom tfie 
following description and fiom the claims. 

Detailed Description of the Invention 

20 

Methods of Altering a Protein Naturally Expressed in an Animal 

Transgenic animals can be produced which include an altered form of a 
protein naturally expressed by the transgenic animal. Such animals can be used to 
produce a heterologous polypeptide which can be separated fiom the host 
25 animal's endogenous version of the protein. 

The term "altered protein** or ^'altering a naturally expressed protein" 
refers to a modification in the gene encoding die protein and/or the amino acid 
sequence of the protein. Alterations can include, for example, alterations in the 
gene sequence, or alterations in transcription or translatioiL As used herein, ttie 
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term '^turally expressed protein" refers to an endogenous protein of ihe host 
animal. For example, bovine serum albumin is an endogenous protein expressed 
inbovines. 

Altering of the host endogenous gene can be done using molecular 

5 biological techniques to first clone the genes and &en constmct DNA molecule 
fliat can be introduced into the host animal's cells. 

This can be done by replacing die endogenous host gene with a sequence 
encoding the heterologous polypeptide. The term '^replacing" as used herein, 
refers to "replacement of expression'* and is not linked to an ''exchange*' of one 

10 gene for another in &e genome. Thus, a DNA sequence, e.g., a cDNA, encoding a 
heterologous polypeptide can be inserted prior to the initiation codon of the gene 
encoding die naturally e3q)ressed protein with or without removing the sequence 
coding for die naturally expressed protein fiom the genome. For example, a 
cDNA encoding human serum albumin can be inserted into exon 2 of die bovine 

15 serum albumin gene without removing the sequence coding for bovine serum 
albumin for expression in a transgenic bovine. In anodier preferred embodiment, 
the gene encoding the naturally expressed protein is exchanged widi a DNA 
sequence, e.g., the genomic sequence, encoding a heterologous polypeptide, e.g., 
die sequence encoding the naturally expresses protein is removed and replaced 

20 widi, e.g., the genomic sequence of a heterologous polypeptide. The sequence 
encoding a heterologous polypeptide can be inserted: after the 5' untranslated 
region (UTR) of the gene encoding die naturally expressed protein; prior to the 3' 
UTR of the gene encoding the natumlly expressed protein; and/or in between the 
S' UTR and die 3' UTR of die gene enco^g die naturally expressed protein. 

25 In addition, when the sequence encoding die naturally expressed protein is 

not removed from die host genome, other modifications can be made to die host's 
genome. For example, if the host endogenous protein is secreted, the signal 
sequence can be removed from die protein. In addition, a termination sequence. 
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e.g.» a bovine polyA tennination sequence, can be inserted onto the *3 end of tiie 
cDNA encoding heterologous polypeptide. 

Another approach to altering a naturally expressed protein is to place a tag 

5 on die endogenous protein which will allow it to be more easily purified away 
fiom the transgenic product, i.e., die heterologous polypeptide. The endogoious 
protein can be altered by changing die sequence of die gene, e.g., by adding, 
deleting, or substituting a nucleotide, such diat the protein is changed. Hie 
naturaUy expressed protein can be changed by, for example, deleting at least one 

10 amino acid, adding at least one amino acid, or the substitution of at least one 
amino acid. A nucleotide can be added at the C-terminus, the N-tenninus, or at 
internal points within the gene, e.g. to achieve the change in the naturally 
expressed protein. 

One approach to changing die endogenous protein is to add at least one 

15 amino acid to die protein, e.g., an added amino acid or amino acids that can bind 
to a preselected ligand. For example, the endogenous protein can be changed by 
addition of an afimity tag to the gene encodmg die endogenous protem. 
Preferably more dian 1-100, 5-50, 10-30 amino acid residues can be added. The 
residues allow for a physical difference between the altered protein and the 

20 heterologous polypeptide. Examples of afSnity tags include a 6X HIS tag, a 
cellulose binding domain (CBD) tag, a maltose binding protein (MB?) and any 
other peptide that would have a minimal effect on the host animal. An altered 
protein which includes additional amino acids which can bind a preselected ligand 
can be used to separate die altered protein fiom the sample. For example, an 

25 altered protein can be separated fiom a sample by binding the altered protein to a 
preselected ligand, e.g., a 6X HIS ligand (e.g., a metal chelating column), CBD 
ligand (e.g., cellulose) or a MBP ligand (e.g., maltose). 

Anodier approach to changing the endogenous protein is to modify die 
gene encoding the protein such that the stop codon of the bovii^ serum albumin is 
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altered, e.g., &e stop codon is altered by substitution of at least one nucleotide to 
the stop codon. In another preferred embodiment, die stop codon is altered by 
removal of at least one nucleotide or addition of at least one nucleotide encoding 
die stop codon, e.g., removal of at least one nucleotide in the TAA stop codon of 

5 die sequence encoding bovine serum albumin. The stop codon can also be altered, 
at the protein level, by changing an amino add sequence of the protein, e.g., the 
stop codon is altered by addition of at least one amino acid For example, an 
amino acid can be added using a suppressor tRNA and a transferase enzyme. A 
transgenic sequence encoding a suppressor tRNA can be expressed in the 

10 transgenic mammal. In addition, a transgenic sequence encoding a transferase 
enzyme can be expressed in a transgenic mammal. Preferably, a transgenic 
sequence encoding either a suppressor tRNA or a transferase enzyme is under the 
control of a tissue-specific promoter. Thus, in order to limit the effect of 
suppressing all proteins which have the particular base codons when a suppressor 

15 tRNA is used, a tissue-specific promoter can be used in order to limit expression 
of the tRNA and transferase enzyme mostly to die desired orgaiL For example, 
when the endogenous protem is bovine serum albumin which is produced in die 
liver, a liver-specific promoter can be used to limit expression of the tRNA to the 
liver. 

20 Preferably, the suppressor tRNA can recognize at least 2, 3, 4, 5, 6 base 

codons, e.g., the suppressor tRNA recognizes die TAAAC sequence at the end of 
the bovine serum albumin coding region. 

The mediods of altering the naturally expressed protein can be done by 
manipulating one copy of the endogenous gene, followed by nuclear transfer to 

25 obtam an embryo, fetus or adult animal, which can be used to generate a cell Ime, 
e.g., a fetal fibroblast cell line. Such cells can be used to produce a transgenic 
animal that is heterologous for the gene which encodes the altered polypeptide. 
These cell lines can also be used to manipulate the other wild type copy of the 
endogenous gene. The resulting cell line should dien carry altered endogenous 
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gene in both alleles. Such cells can be used to produce a transgenic animal diat is 
homozygous for the gene which encodes the altexed protein. 

Eiflier of these cell lines can flien be used in the nuclear transfer process to 
generate flie production herd of animals, e.g., catde or goats» that not only produce 
5 the heterologous polypeptide encoded by die transgene, e.g., in ttieir milk, but also 
cany the altered naturally expressed protein, e.g., in their circulation. 

Transfected Cell Lines 

Genetically engineered cells for production of a transgenic maimnal can be 

10 obtained fiom a cell line into which a nucleic acid of interest, e.g., a nucleic acid 
which encodes a protein, has been introduced. 

A constnict can be introduced into a cell via conventional transformation 
or transfection techniques. As used herein, the terms *iransfection'* and 
'transformation" include a variety of techniques for introducing a transgenic 

15 sequence into a host cell, including calcium phosphate or calcium chloride co- 
precipitation, DBAE-dextrane-mediated transfection, lipofection, or 
electroporatioa In addition, biological vectors, e.g., viral vectors can be used as 
described below. Suitable methods for transforming or transfecting host cells can 
be found in Sambrook et al.. Molecular Cloning: A Laboratory Manuel, ed., 

20 Cold Spring Harbor Laboratory, (Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY, 1989), and other suitable laboratory manuals. 

Two useful approaches are electroporation and Upofection. Brief 
examples of eac:b. are described below. 

Ihe DMA construct can be stably introduced into a donor cell line, e.g., an 

25 embryonic cell, e.g., an embryonic somatic cell line, by electroporation using ttie 
following protocol: ttie cells are resuspended in PBS at about 4x10^ cells/mL 
Fifty micrograms of linearized DNA is added to the O.S ird cell suspension, and 
the suspension is placed in a 0.4 cm electrode gap cuvette ^iorad). 
Electroporation is performed using u Biorad Gene Pulser electroporator wifli a 330 
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volt pulse at 25 mA, 1000 microFarad and infinite resistance. If Ae DNA 
constnict contains a Neomyocin resistance gene for selection, neomyocin resistant 
clones are selected following incubation with 350 microgram^ of G418 
(GibcoBRL) for 15 days. 

5 The DNA constnict can be stably introduced into a donor cell line by 

lipofection using a protocol such as the following: about 2x10^ cells are plated 
into a 3.5 cmiameter well and transfected with 2 micrograms of linearized DNA 
usmg LipfectAMINE™ (GibcoBRL). Forty-eight hours after tiansfection, the 
cells are split 1 : 1 000 and 1 :5000 and, if &e DNA construct contains a neon^osin 

10 resistance gene for selection, G41 8 is added to a final concentration of 0.35 
mg/ml. Neomyocin resistant clones are isolated and expanded for 
cryopreservation as well as nuclear transfer. 

Transgenic Mammals 

15 Methods for generating non-human transgenic mammals are known in flie 

art Such methods can involve introducing DNA constructs into the germ line of a 
manunal to make a traasgenic mammal. For example, one or several copies of tte 
constnict may be incorporafjed into the genome of a mammalian embryo by 
standard imnsgenic techniques. 

20 Alft ough bo^^nes and goats are a preferred source of cells, other non- 

human mammals can be used. Preferred non-human mammals are ruminants, e.g., 
cows, sheep, camels or goats. Additional examples of preferred non-human 
animals include oxen, horses, llamas, pigs, mice and rats. The mflnwifil used as 
the source of cells, e.g., genetically engineered cell, will depend on fhe transgenic 

25 mammal to be obtained. By way of an example, fhe genome firom a bovine should 
be used ftom nuclear transfer with a bovine oocyte. 

Methods for tiie preparation of a variety of transgenic animals are known 
in fhe art. Protocols for producing transgenic goats are known in the art For 
example, a transgene can be introduced into flie germline of a goat by 
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microinjection as described, for example, in Ebeit et aL (1994) Bio/Technology 
12:699. A protocol for &e production of a transgenic pig can be fomid in White 
and Yannoutsos, Current Topics in Complement Research: 64th Forum in 
Immunology, pp. 88-94; US Patent No. 5,523,226; US Patent No. 5,573,933; PCX 

5 Application WO93y25071; and PCT Application WO95/04744. A protocol for 
the production of a transgenic rat can be found in Bader and Ganten, Clinical and 
Experimental Pharmacology and Physiology, Supp. 3:S81-S87. 1996. A protocol 
for fte production of a transgenic cow can be found in U.S. Patent No: 5,741 ,957, 
and Transgenic Animal Technology, A Handbook, 1994, ed., Cail A. Pinkert, 

10 . Academic Press, Inc. A protocol for the production of a transgenic sheep can be 
found in PCT Publication WO 97/07669, and Transgenic Animal Technology. A 
Handbook, 1994, ed., Carl A. Pinkert, Academic Press, Inc. 

Tissue*Specific Expression of Proteins 

15 It is often desirable to express a protein, e.g., a heterologous protein, in a 

specific tissue or fluid, e.g., the milk^ blood or urine, of a transgenic animal. The 
heterologous protein can be recovered from the tissue or fluid in which it is 
expressed. For example, it is often desirable to express the heterologous protein 
in milk. Methods for producmg a heterologous protein under fte control of a milk 

20 specific promoter are described below. In addition, other tissue-specific 
promoters, as well as, other regulatory elements, e.g., signal sequences and 
sequence which enhance secretion of non-secreted proteins, are described below. 

Milk Specific Promoters 
25 Useful transcriptional promoters are those promoters that are preferentially 

activated in mammary epithelial cells, including promoters that control die genes 
encoding milk proteins such as caseins, beta lactoglobulin (Clark et al. (1989) 
Bio/Technology 7: 487-492), whey acid protein (Gordon et al. (1987) 
Bio/Tedimlogy S: 1 1834 187), and lactalbumin (Soulier et al., (1992) FEBS Letts. 
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297: 1 3). Casein promoters may be derived from die alpha, beta, gamma or kappa 
casein genes of any mammalian species; a preferred promoter is derived from fbe 
goat beta casein gene (DiTulIio^ (1992) Bto/Technology 10:74-77). Milk-specific 
protein promoter or the promoters that are specifically activated in mammaiy 

5 tissue can be derived from cDNA or genomic sequences. Preferably, ftey are 
genomic in origin. 

DNA sequence information is available for tfie mammaiy gland specific 
genes listed above, in at least one, and often in several oi^anisms. See, e.g., 
Richards et al. (1981) /. Biol Oiem. 256, S26-S32 (a-lactalbuminiat); Canq>bell 

10 et al. (1984) Nuddc Adds Res. 12, 8685-8697 (rat WAP); Jones et al. (1985) /. 
Biol Chem. 260, 7042-7050 (rat P-casein); Yu-Lee & Rosen (1983) /. Biol Chem. 
258, 10794-10804 (rat y^asein); HaH, Biochem. I 242, 735-742 (1987) (a- 
lactslbvjtiin braan); Stewart, Nucleic Acids Res, 12, 389 (1984) (bovine asl and 
K casein t^DNAs); Gorodetsky et al.(198?») Gene 66, 87-96 (bovine P casein); 

15 Alexander el al. (1988) Eur. /. Biochem. 178, 395-401 (bovine k casein); Brignon 
et al. (197?) FEBS Lett 138, 48 55 (bovine aS2 casein); Jamieson et al. (1987) 
Gene 61, 85-90. Ivfaiov et al.(1933) Biol Chem, Hoppe-Seyler 369, 425^29 
(1988), Alexander st al (1989) NucUicAdds Re$. 17, 6739 (bovine P 
lactoglobulin); V'ilous d al. (198'/) BiochiTnie 69, 609-620 (bovine a- 

20 lactarnumln). Ths fitructuie and function of fbe various niiDc protein genes are 
reviewed by Mercier & Vilotte (1993) 7. Dmry Sd. 76, 3079-3098 (incorporated 
by reference in its (entirety for all purposes). If additional flanking sequences are 
usefiil in optimizing expression of the heterologous protein, such sequences can be 
cloned using the existing sequences as probes. Mammary-gland spedSc 

25 regulatory sequences fiom different organisms can be obtamed by screening 
libraries from such organisms using known cognate nucleotide sequences, or 
antibodies to cognate proteins as probes. 
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Signal Sequences 

Use&l signal sequences are milk-specific signal sequences or otfier signal 
sequences which result in the secretion of eukaxyotic or prokaiyotic proteins. 
Preferably, the signal sequence is selected fiom milk-specific signal sequences, 

5 i.e., it is fix>m a gene which encodes a product secreted into milk. Most 
preferably, the miDc-specific signal sequence is related to the milk-specific 
promoter used in the constcuct, which are described below. The size of the signal 
sequence is not critical. All that is required is that the sequence be of a sufficient 
size to effect secretion of the desired recombinant protein, e.g., in the mammary 

10 tissue. For example, signal sequences from genes coding for caseins, e.g., alpha, 
beta, gamma or kappa caseins, beta lactoglobulin, whey acid protein, and 
lactalbumin can be used. A preferred signal sequence is fhe goat ^-casein signal 
sequence. 

Signal sequences from other secreted proteins, e.g., proteins secreted by 
15 kidney cells, pancreatic cells or liver cells, can also be used. Preferably, die signal 
sequence results in tihe secretion of proteins into, for example, urine or blood. 

Amino^Terminal Regions of Secreted Proteins 

A non-secreted protein can also be modified in such a manner that it is 

20 secreted such as by inclusion in the protein to be secreted of all or part of the 
coding sequence cf a protein which is normally secreted. Preferably the entire 
sequence of the protein \vhich is normally secreted is not included in the sequence 
of tfo.e protein but ratiier only a sufficient portion of fte amino terminal end of ttie 
protein which is normally secreted to result in secretion of the protein. For 

25 example, a protein which is not normally secreted is fused (usually at its amino 
terminal end) to an amino terminal portion of a protein which is nonnally 
secreted. 

In one aspect, the protem which is normally secreted is a protein which is 
nonnally secreted in milk. Such proteins include proteins secreted by mammaiy 
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q)iftelial cells, milk proteins such as caseins, beta lactoglobulin, whey acid 
protein , and lactalbumin. Casein proteins include alpha, beta, gamma or kq>pa 
casein genes of any mammalian species* A prefened protein is beta casein* e.g., 
goat beta casein. The sequences which encode the secreted protein can be derived 
5 fixmi either cDNA or genomic sequences. Preferably, they are genomic in origin, 
and include one or more introns. 

Other Tissue-Specific Promoters 

Oflier tissue-specific promoters which provide expression in a particular 
10 tissue can be used. Tissue specific promoters are promoters which are expressed 
more strongly in a particular tissue than in others. Tissue specific promoters are 
often expressed essentially exclusively in die specific tissue. For example, if the 
altered protein is normally expressed in the Uver, a liver-specific promoter can be 
used. For example, a liver-specific promoter can be used when a suppressor 
15 tRNA is used to alter serum albumin. In this situation, a transgenic sequence 
encoding tiie suppx^essor tRNA can be under die control of a liver-specific 
promoter. 

Tissue-specific promoters which can be used include: a neural-specific 
promoter, e.g., nesdn, Wnt-1 , Pax-1, Engrailed-l, Engrailed-2, Sonic hedgehog; a 

20 liver-specific promoter, e.g., albumin, alpha-1 antiliypsin; a muscle-specific 

promoter, e.g., myogenin, actin, MyoD, myosin; an oocyte specific promoter, e.g., 
ZPI, 2P2, ZP3; a testes-specific promoter, e.g., protamin, fertilin, synaptonemal 
complex protein-1; abiood-specific promoter, e.g., globulin, GATA-1, 
porphobilinogen deaminase; a lung-specific promoter, e.g., surfiu^tant protein C; a 

25 skin- or wool-specific promoter, e.g., keratin, elastin; endofhelium-specific 
promoters, e.g., Tie-1, Ti<^-2; and a bone-specific promoter, e.g., BMP. 

In addition, general promoters can be used for expression in several 
tissues. Example.^ of geroeml promoters include p-actin, ROSA-21, PGK, FOS, c- 
myc, Jun-A, and Jun-3, 
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Insulatpr Sequences 

The DNA constmcts used to make a tzansgenic animal can include at least 
one insulator sequence. The teims 'insulatof *, "insulator sequence" and 

5 ^insulator elemenf* are used interchangeably herein. An insulator element is a 
control element which insulates the transcription of genes placed wittun its range 
of action but which does not pertuib gene expression, eifter negatively or 
positive!)'. Preferably, an insulator sequence is inserted on either side of tiie DNA 
sequence to be transcribed. For example, the insulator can be positioned about 

10 200 bp to a'Dout 1 kb, 5' from flie promoter, and at least about 1 kb to S kb from 
the promoter, at the 3* end of Ihe gene of interest The distance of the insulator 
sequence from the promoter and the 3 ' end of the gene of interest can be 
detennined by those skilled in the art, depending on the relative sizes of the gene 
of interest, tfie promoter and the enhancer used in die construct In addition, more 

15 flian one insulator sequence can be positioned 5* from the promoter or at die 3 * 
end of the transgene. For example, two or more insulator sequences can be 
positioned f>' from the promoter. The insulator or insulators at die 3' end of the 
transgene can be positioned at the 3' end of the gene of interest, or at the 3*end of 
a 3' regulatory sequence, e.g., a 3' untranslated region (UTR) or a 3* flanking 

20 sequence. 

A preferred insulator is a DNA segment which encompasses flie 5* end of 
tiie chicken P-globin locus and corresponds to the chicken 5' constitutive 
hypersensitive site sis described in PCT Publication 94/23046, the contents of 
which is incorporated bcrsin by reterence. 

25 

DNA Constmcts 

A cassette which encodes a heterologous protein can be assembled as a 
construct which includes a promoter, e.g., a promoter for a specific tissue, e.g., for 
mammary epithelial cells, e.g., a casein promoter, e.g., a goat beta casein 
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promoter, a milk-specific signal sequence, e.g., a casein signal sequrace, e.g., a 
casein signal sequence, and a DNA encoding Hie heterologous protein. 

The construct can also include a 3' untranslated region downstream of flie 
DNA sequence coding for ttie non-secreted protein. Such regions can stabilize the 

5 RNA transcript of the expression system and thus increases fte yield of desired 
protein from flie expression system. Among the 3' untranslated regions useful in 
die constructs for use in the invention are sequences that provide a poly A signal 
Such sequences may be derived, e.g., fiom the SV40 small t antigen, the casein 3' 
untranslated region or other 3* untranslated sequences well known in fiie art. In 

10 one aspect, die 3* untranslated region is derived from a milk specific protein. The 
lengdi of the 3* untranslated region is not critical but the stabilizing effect of its 
poly A transcript appears important in stabilizing tfie RNA of the expression 
sequence. 

Optionally, the constmct can include a 5' imtranslated region between flie 
15 promoter and the DNA sequence encoding the signal sequence. Such untranslated 
regions can be fiom the same control region fix>m which promoter is taken or can 
be fix>m a different gene« e.g., fliey may be derived fix>m ofter syndietic, semi- 
synthetic or natural sources. Again their specific length is not critical, however, 
they appear to be usefiil in improving the level of expressioit 
20 The construct can also include about 10%, 20%, 30%, or more of the N- 

terminal codir.g region of a gene preferentially expressed in mammary epithelial 
cells. For example, the N-terminal coding region can correspond to the promoter 
used, e.g., a goat p-casein N-terminal coding region. 

The construct C3xi be prepared using metfiods known in the art The 
25 construct cat be prepared as part of a larger plasmid. Such preparation allows the 
cloning and selection of fte correct constructions in an efficient manner. The 
construct can be located between convenient restriction sites on die plasmid so 
that they can be easily isolated from the remaining plasmid sequences for 
incorporation into the desired mammal. 



33 



wo 00/61725 PCT/USOO/10095 



Heterologous Proteins 

Transgenic sequences encoding heterologous proteins can be introduced 
into the gennline of a non-human manunal or can be transfected into a cell line as 

5 described above. The protein can be a complex or multimeric protein, e.g., a 
homo- or heteromultimer» e.g., proteins which naturally occur as homo- or 
heteromultimers, e.g., homo- or hetero- dimers, trimers or tetramers. The protein 
can be a protein which is processed by removal, e.g., cleavage, of N-tezminus, C- 
terminus or internal fiagments. Even complex proteins can be expressed in active 

10 form. Protein encoding sequences which can be introduced into tte genome of 
mammal, e.g., bovines or goats, include a serum protein, a milk protein, a 
glycosykted or a non-glycosylated protein. The protein may be human or 
non-human in origin. The heterologous protein may be a potential therapeutic or 
pharmaceutical agent such as, but not limits to: aIpha-1 proteinase inhibitor, 

15 alpha-1 antitiypsine, alkaline phosphatase, angiogenin, antilhrombin m, any of 
flie blood clotting factors including Factor Vin, Factor DC, and Factor X chitinase, 
erythropoietin, extracsHular superoxide dismutase, fibrinogen, 
glucocerehrosid?.se, glutamate decarboxylase, human growth &ctor, human serum 
albumin, iTmnnnoglobu.!in, insulin, myelin basic protein, proinsulin, prolactin, 

20 soluble CD4 or a component or complex thereof, lactoferrin, lactoglobulin, 
lysozyme, lactalbimiin, tissue plasminogen activator or a variant fliereof. 

Nucleotide sequence information is available for several of the genes 
encoding th? heterologous proteins listed above, in at least one, and often in 
several organisms- See e.g„ Long et al. (1984) Biochem. 23(21):4828'4837 

25 (aplha-1 anlilr/psin); Mitchell et al. (1986) Prot Natl Acad. Sci USA 83:7182- 
7186 (alkaline phosphatase); Schneider etal. (1988) £MBO J. 7(13):4151-4156 
(angiogenin); Bock et al. (1 988) Biochem. 27(16):6171-6178 (antiflironibin IK); 
Olds et al. (1991) 5r. 1 Haematol 78(3):408-41 3 (antithrombin HI); Lin et al. 
(1985) Proc. Natl Acad. Set USA 82(22):7580-7584 (erythropoeitm); U.S. Patent 
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No. 5,614,184 (eiyfcropoietm); Horowitz et al. (1989) Genomics 4(l):87-96 
(glucocerebrosidase); Kelly et al. (1992) Ann. Hum, Genet. S6(3):2SS-26S 
(ghitamte decaiboxylase); U.S. Patent No. 5,707,828 (human serum albumin); 
U.S. Patent No. 5,652,352 (human serum albumin); Lawn et al. (1981) Nucleic 

5 Acid Res. 9(22):6103-61 14 (human serum albumin); Kamholz et al. (1986) Prot 
Natl Acad. Sci. USA 83(13):4962^966 (myelin basic protein); Hiraoka et al. 
(1991) MoL Cell Endocrinol. 75(1):71^80 (prolactin); U.S. Patent No. 5.571,896 
(lactoferrin); Pennica et al. (1983) Nature 301(5897):214-221 (tissue plasminogen 
activator); Saxafanov et al (1995) MoL Biol 29:161-165, flie contents of which 

10 are incorporated herein by reference. 

Examples 

Replacement of bo\nne serum albumin (BSA) with human serum albumin fHSA) 
1 5 One method of reducing the contaminating BSA in milk is to replace it 

widi the human version, i.e., HSA. Since the human and bovine serum albumin 
proteins are very similar and since they do not carry out enzymatic functions, 
human s^rum albumin can replace BSA in the animal. 

20 Seram albumin is normally produced in the liver and is found at high 

levels in the scrum. In tiiis series of experiments, die bo\ine gene that codes for 
BSA. is rej)laced the gene encoding &e human version. The resulting animals 
produce high levels of HSA in their bloodstream. Any serum albumin fliat then 
leaks into the m.immaiy gland is ESA. Since serum albumin has no glycosylation 

25 sites, die HSA produced in die liver and found in die serum is identical to that 
produced in the mammary gland by the milk-specific promoters. This process 
requires the cloning the genes for human and bovine serum albumin. Tbe human 
gene is then mtroduced into the bovine genome in a manner to replace the cow's 
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own gene. This process takes place in cells grown in culture fliat is eventually be 
used to generate live cattle by a process of nuclear transfer. 



Cloning of the cDNA sequence for BSA 
5 The sequence of BSA mRNA is published in GenBank as M73993 which 

contains 2035 bp of sequence. The mRNA is likely to be identical in different 
lines of cattle. The cDNA fifom a commercial bovine libraiy can be cloned and 
sequenced to confirm that it is coirect. The cDNA can dien be used in generatmg 
both BSA Tag versions and in the replacement process. 

10 

Cloning of BSA genomic sequence 

The cDNA sequence can be used as a probe to clone the genomic version 
of BSA. Although the sequence coding for die BSA protein should be identical, 
die level of heterogeneity of the intron and flanking regions can be determined 
15 and used as a target sequence for homologous recombination. Preferably, the cell 
line is one that has aheady given rise to an animal that produces HS A in its milk. 
The gene from founder sjuooials which were derived by microinjection of a 
transgene and which carry the construct enabling them to produce HSA in tiieir 
milk can be us?ii . 

20 

Construction of HSA cDNA in Exon I of BSA 

One means of replacing the BSA is to place Ae cDNA sequence encoding 
HSA just 5' of the initiating codon of BSA which is found in Exon 2 of the gene. 
By using oligomutigenesis, an Xhol site can be introduced into Exon 2. Ixl 
25 addition, the signal sequence of BSA can be removed. For example, a construct 
which includes a cDNA version of HSA that carries an optimized Kozak sequence 
and is contained in a 1 .5 kb Xhol fragment can be used. Earlier work has shown 
that this cDNA sequence can be expressed at high levels in transgenic animals. 
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Transcription occurs through this hybrid gene and ttie HS A is secreted, but die 
BSA coding sequence, which has been disrupted, is not 

In order to more fiilly insure no expression of the BSA gene, the bovine 
5 bgh poIyA termination sequence can be inserted on the 3' end of flie inserted HSA 
cDNA. This should end transcription at the end ofthe HSA gene. The remaining 
portion of BSA is not transcribed and, therefore, is silent 

Constmction of HSA replacement of BSA coding sequence 
10 An alternative method of HSA replacement of BSA, is to replace flie 

coding region of the genomic BSA mHi the genomic version of HSA. TUs 
method includes introduction of the same unique cloning sites into the 
untranslated region (UTR) of bodi genomic HSA and genomic BSA. The coding 
sequence of HSA is then cloned into the corresponding regions of BSA. The 
15 resulting vectors, therefore, contain the coding region of HSA flanked by flie UTR 
of BSA. These BSA TJTRs can be used for homology to cany out recombination 
in which the genomic coding region of BSA is replaced by HSA. This gene 
construct can be used for higih-level expression by the endogenous bovine 
promoter. 

20 

Addition of An A amity Tag to BSA 

Since it is often necessary to remove the BSA from the process to such a 
low level, an afiinity tag can be placed on this protein that allows for tigiht binding 
in fhe presence of HSA. An example of such an afiinity tag is a 6 X HIS tag, 
25 which can bd used to bind proteins at nanogram levels . Proteins with diis tag can 
be isole.ted by their ability to bind to a metal chelating column. Oflier afiGnity tags 
can be utilized as well. Examples of such tags include cellulose binding domain 
(CBD), maltose binding protein (MBP), or any peptide that would have minimal 
effect on the host animal. 
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BSA6XfflS Fusion 

Since the sequence of BSA is known, it is possible to design 
oligonucleotides that can be used to add a 6X HIS tag to the C-tenninus of the 
5 BSA coding region. Once tiie cDNA of BSA has been cloned, the 6X HIS tag is 
Ugated to the C-tenninus using standard techniques. 

Expression of the BSA Tag 

The BSA Tag can be expressed both in tissue culture and in die miDc of 
10 transgenic nice. This allows for the abiUty of the new version of BSA to bind to 
metal chelating coluums to be iiested. It ulso can be used to show that the BSA 6X 

HIS Tug is capable of bsiag L'e-:r:si&d iiorn 'manurialian cells. 

Teol for iiSA protein binding to Clielatin^^ Column 
15 Human serum albumin protein can be tested for its binding capacity to the 

metal chelating columns. 

Comparison of HSA and BSA Tag 

The experiment can be used to compare the relative binding of the two 
20 proteins in a purification process. The 6X HIS tag allows for removal of the BSA 
fix)m a solution of HSA to levels required by FDA standards, (1 ppm). Briefly, 
HSA is added to cow milk at 10 g/L. The BSA Tag is added to the mixture to 02 
g/L. A mock purification can be carried out, similar to the processes aheady 
described for HSA purification. Using such a method, the level the BSA Tag can 
25 be tested to determine if it can be removed to less than 1 ppm level. 

■A .dditiou of CBD fjision t ag 

Another tag that ca-i be added to BSA is tie affinity protein for cellulose 
binding, CBD, Tue 1 5kd portion of the enzyme for cellulose degradation contains 
30 the region, of the enzyme that specifically binds to cellulose. 
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CBD BSA cDNA Construct 

Utilizing the cDNA for BSA, die coding region of the CBD protein 
fragment can be linked in frame to the BSA gene. A version of tiie CBD protein 

5 has been designed tbat utilisoes the codon preference of the niarimiaiy gland. In 
addition, the glycosyladon sites have been removed allowing more effective 
binding to cellulose. A CBD fusion of this peptide has been successfrilly 
expressed iri both tissue culture &nd into the milk of transgenic mice. Using 
standard techniques, the CBD gene can be linked in frame to the coding region of 

10 BSA, such that a fusion protein can be produced. 

Expression of BSA-CBD fiision 

This protein can be pressed in both tissue culture and in milk of 
transgenic mice. It is expected that the protein can specifically bind to cellulose. 

15 This can be tested by the use of cellulose beads to specifically bind flie fusion 
protein. It is important that a cellulose process can scrub all of the BSA-CBD out 
of a milk .<;ohition that contains high levels of HSA. It is also necessary to show 
diat the fusion protein is capable of being expressed at high levels. Since this 
fusion protein is replacing the animal*s normal serum albumin, it is necessary to 

20 have it expressed as readily as BSA. 

Effscti veness of BindinR 

Thft BSA-CBD protein can be produced in both tissue and in milk. The 
protem c?Ji hfi tested for its ability to bind to cellulose in die presence of high 
25 levels of HSA. Tliis cm be done on a pilot scale using low levels of BSA-CBD. 
Assays can be used which detect ppm levels of BSA in the presence of HSA. The 
cellulose needs to remove the BSA-CBD to ftese levels. 

Changing the BSA protein (BSA) 
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Another method of changing the chaiacteristics of BSA to make it more 
easily separated or purified from HS A is to add a peptide tail. Due to limitations 
on die size of DNA that can be used in homologous lecombination, veiy small 
segments of DNA may be needed to effect changes in the BSA molecule. Since 

5 die sequence of BSA and its 3VTR is known, this can be acconq>Iished by 
eliminating the stop codon TAA. This gives rise to an extended version of BSA 
that is translated until the next stop codon is reached. Depending upon which 
reading frame is chosen, the downstream fusion peptide can be of three dififexent 
types. They range in size from 11, 20 or 45 additional amino acids. These forms 

10 of BSA are altered in their physical properties such that diey are more easily 
purified from the HS A. 

Testing the Extended BSA 

The cDNA of BSA sequence is published and the clone can be obtained. 
1 5 Using oligo mutagenesis, the TAA stop codon can be eliminated along with the 
number of bp to put die downstream sequence into each of die three reading 
frames. These dnee extended BSA molecules can dien be expressed in tissue 
culture. 

20 Separation Chgrsicteristics 

Bach ^'ersion can be tested for its purification characteristics. It is 
preferable thot the new BSA is separated from die HSA during die processing 
steps, d^eieby eliminating die need for a step that specifically removes the BSA. 

25 Effects on die host animal 

Another ui^ortant aspect is whedier the BSA is easily secreted. Since this 
version of BSA is the only one that the animal carries, it is important that it be 
capable of expression at the same level as normal BSA. 
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Animals can be produced that will first carry one altered allele. These 
animals can be monitored to test any efiects due to altered BS A. In addition, flie 
gene can be used to generate transgenic mice that over express the protein in flieir 
liver. These animals can also be tested for effects due to altered BSA. 

5 

Dominant Change in BSA 

Due to the time required to generate homozygous catde, an alternative can 
be attempted to alter the BSA by the introduction of additional genes that will 
function to change the BSA expression. This dominant phenotype can 
10 significantly change the timelines, in that only heterozygous animals would be 
needed to start production. The TAA stop codon can be changed utilizing a 
Suppressor tRNA. 

15 Suppressor tRNA 

Suppressor tRNA molecalcs function by introducing an amino acid at a 

stop codori. Tht:y have been expensively studied in bacterial genetics and are 

being developed to function in mammalian cells. The tBNA also requires its 

complementary amino acid 
20 transferase enzyme that adds the amino acid to the tRNA. Since fliis protein is not 

present in nciammalian cells, tins gene must be introduced for die Siq>pressor 

tRNA to function. 

Suppres sor tRNA to produce BSA Exten ded 
25 Ati ex:35nd?d version of BSA v.an be pr*?d\\ced if the stop codon were 

removed fi-om the BSA gene. A Suppressor tRNA specific to that TAA stop 
codon, can £.ccomplish that by introducing an amino acid at HbaX site. If tiie 
tRNA Suppressor and its corresponding transferase are expressed at high levels in 
die animal's liver, the BSA produced may be of the BSA, extended form. This 
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requires dominant expression of these new genes such ihat all of the BSA mRNA 
produced in the liver is acted upon by the suppressor tRNA. The suppression of 
stop codons in the organism may have adverse effects. This can be avoided by 
limiting expression of the suppressor tRNA only to required organs. Since BSA is 
5 produced in the hvcv, the additional genes can be expressed from a liver specific 
promoter, e.g., the BSA promoter. 

Modifying the Suppressor tRNA 

There are many proteins expressed in the liver, and certainly many may 
10 have TAA as ttieir stop codon. In order to limit Ae effect of suppressing all of 
diese TAA stop codons, the suppressor tRNA can be altered. It is known that 
tRNAs can be changed so that they recognize 2 or 4 base codons instead of flie 
normal triplet Since the sequence at tiie end of &e BSA coding region is 
TAAAC, a tRNA that can recognize flie TAAAC sequence can be designed. Tins 
IS provides more specific suppression to die BSA. 

T est Model for tRNA Suppression 

It is necessary to test for a suppressor's effectiveness in a model system. 
To do Ads the genes can be expressed in the mammary gland of die mouse. These 

20 genes can be licked to the milk promoters. Tmnsgenic mice that carry the BSA, 
the suppressor tSNA, and its transferase enzyme, can be generated. When these 
three genes are expressed, the effectiveness of suppression of die BSA can be 
deteimined. The relative expression of each of the genes, as well as, die 
proportion of BSA Extended produced can be measured and compared to die 

25 normal BSA. 

All patents and other references cited herein are hereby incorporated by 

refere:ace. 
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What is claimed: 

1 . A method of providing a heterologous polypeptide expressed by a 
transgenic mammal, comprising: 

5 providing a transgenic mammal which comprises a transgenic sequence 

encoding the heterologous polypeptide and which expresses an altered form of flie 
protein naturally expressed by the transgenic manmial; and 

obtafaiing a sample from the transgenic mammal, wherein the sample 
comprises the heterologous polypeptide and the altered protein; 

10 thereby providing the heterologous polypeptide. 

2. The method of claim 1 , furdier comprising separating the altered 
protein from the sample. 

15 3 . The me&od of claim 2, wherein the sample is milk. 

4. The method of claim 1 , wherein the transgenic mammal is 

homozj'gous fcr tie gene which encodes flie altered protein. 

20 5. The method of claim 1 , wherein the transgenic animal is 

heteroz^/gous for the gene which encodes die altered protein. 

6. The method of claim I , wherein the protein naturally ejq^iressed in 
the transgenic mammal is alteied by n^placing the gene encoding Ike protein with 

25 a DNA sequence encoding the heterologous poh^peptide, 

7. 17?.e method of claim 6, wherein the DNA sequence encoding the 
heterologous polypeptide is inserted prior to die initiation codon of the gene 
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encoding the naturally expiessed protein with or without removing the sequence 
coding for the naturally expressed protein from the genome. 

8. The mediod of claim 7, wherein a signal sequence of fbe naturally 
5 expressed protein is removed. 

9. The meftod of claim 7, wherein a tennination sequence is inserted 
at the 3' end of the DNA sequence encoding the heterologous polypeptide. 

10 10. The method of claim 1 , wherein the sequence encoding tfie 

naturally expressed protein is removed and replaced with a DNA sequence which 
encodes the heterologous polypeptide. 

1 1 . Tlie method of claim 1 , wherem the protein naturally expressed in 
15 the transgenic mammal is altered by adding, deleting, or substituting a nucleotide, 

such that the protein is changed. 

12. The method of claim 1 1 , wherein the protein naturally expressed in 
tiie transgenic mammal can be changed by deleting at least one amino acid, adding 

20 at least one amino acid, or the substituting at least one amino acid. 

13. The method of claim 12, wherein at least one amino acid is added 
and the added amino acid or amino acids diat can bind to a preselected ligand. 

25 14. The method of claim 13, wherein &e preselected ligand is selected 

from the group consisting of: a 6X HIS ligand, a cellulose binding domain (CBD) 
ligand, and a maltose binding protein (MBP) ligand. 
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IS. The method of claim 12, wherein ihe protein naturally expressed in 
die transgenic mammal is changed by addition of an affinity tag to the gene 
encoding the naturally expressed protein. 

5 16. The method of claim 1 , wherein the protein naturally expressed in 

the transgenic mammal is altered by changing the gene encoding die protein such 
diat the stop codon of the protein is altered. 

1 7. The method of claim 1 6, wherein die stop codon is altered by 
10 substitution of at least one nucleotide in the stop codon. 

1 8. The method of claim 1 6, wherein the stop codon is altered by 
removal of at least one nucleotide or addition of at least one nucleotide in the 
sequence encoding the stop codon. 

15 

1 9. The method of claim 12, wherein the protein naturally expressed in 
die transgenic mammal is altered by changing an amino acid sequence of the 
protein such theit the stop codon is altered. 

20 ? 0. Th e method of claim 1 9, wherein the stop codon is altered by 

addition of at lear.t one amino acid. 

21 . The mediod of claim 20, wherein the amino acid is added using a 
suppressor tRNA and a transferase enzyme. 

25 

22. The method of claim 2 1 , wherein the transgenic mammal 
comprises a transgenic sequence encoding a suppressor tRNA. 
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23. The method of claim 21, wherein the transgenic mamniflj 
comprises a transgenic sequence encoding a transferase enzyme. 

24. The me&od of claim 1 , wherein the transgenic mammal is selected 
5 from the group consisting of a goat, a bovine, a sheep, a pig, a horse, a mouse, and 

a rat 

25. The method of claim 1 , wherein the transgenic TnaTmnal is a cow. 

10 26. The method of claim 1, wherein the transgenic mammal is a goat 

27. The method of claim 1 , wherein the heterologous polypeptide is a 
human polypeptide. 

15 28. The method of claim 1 , wherein the heterologous polypeptide is 

selected, from Hvt group consisting of: alpha proteinase inhibitor, alkaline 
phosphotase, angiogenin, extracellular superoxide dismutase, fibrogen, 
glucocerebrosidase, ghitamate decarboxylase, human serum albumin, myelin basic 
protein, proinsulin, soluble CD4, lactofeiiin, lactoglobulin, lysozyme, 

20 lactoalbumin, erythrpoietin, tissue plasminogen activator, himian growth fiictor, 
antithrombin III, insulin, prolactin, and alpha-antitrypsin. 

29. The method of chim 1 , wherein the heterologous polypeptide is 
human serum albumin. 

25 

30. The method of claim 1 , wherein &e heterologous sequence is under 

the conltol of a tissue-specific promoter. 
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3 1 . The method of claim 30, wherein the promoter is a milk specific 
promoter. 

32. The method of claim 1 , wherein the milk*specific promoter is 
5 selected firom the group consisting of: a casein promoter, a beta lactoglobuUn 

promoter, a whey acid protein promoter, and a lactalbumin promoter. 
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